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Abstract 



1 Introduction 



In the market place, diversification reduces risk and 
provides protection against extreme events by ensur- 
ing that one is not overly exposed to individual oc- 
currences. We argue that diversification is best mea- 
sured by characteristics of the combined portfolio of 
assets and introduce a measure based on the informa- 
tion entropy of the probability distribution for the 
final portfolio asset value. For Gaussian assets the 
measure is a logarithmic function of the variance and 
combining independent Gaussian assets of equal vari- 
ance adds an amount to the diversification. 

The advantages of this measure include that it nat- 
urally extends to any type of distribution and that it 
takes all moments into account. Furthermore, it can 
be used in cases of undefined weights (zero-cost as- 
sets) or moments. We present examples which apply 
this measure to derivative overlays. 



Combining different assets in a portfolio changes 
the return and rislQ characteristics. Diversification 
strategies allow to fine tune for risk appetite and pa- 
rameter ranges, which are usually stipulated in in- 
vestment mandates. 

While diversification is an intuitive concept, there 
is no unique quantitative measure of it. The benefits 
of portfolio diversification clearly stem from the in- 
dependence or the offsetting of asset value changes. 
Hence any good measure should take asset character- 
istics into account in addition to portfolio exposures 
to each asset (e.g., asset weights in the portfolio). 

Here we want to give a brief review of existing 
measures before introducing a new measure based 
on the entropy of the return probability distribution 
(for the total portfolio). This measure naturally ex- 
tends to zero-cost assets (for which weights cannot 
be defined) and we show the diversification benefits 
of some derivative overlays. 



We will leave the precise definition of "risk" here open. 
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2 Existing Measures 

Measures have been suggested (for an overview see for 
example [3]), which fall broadly into two categories: 
just weight based and taking return characteristics 
into account. 

Herfindahl Index A simple and widely used 
weight-based diversification index given by 

N 
i=l 

where the Wi are the weights for the N as- 
sets. This measure does not account for inter- 
dependence between assets and differing risk 
characteristics. 

Weight Entropy One way to interpret portfolio 
weights is to see them as the probability of a 
"randomly"0 choosen currency unit to be in- 
vested in a certain asset. One could then argue 
that the entropy difference between these proba- 
bilities and the uniform distribution is a measure 
of information content and diversification. The 
corresponding measure is the weight entropy 

N 

-^t«ilog(iUi). (2) 
i=l 

This measure also has an intriguing sub-division 
property, which relates the overall entropy to the 
entropy of sub-portfolios and the weights of the 
sub-portfolios. 

E = -Wi log(tUi) + ^ WjEj, (3) 

where wi are the portfolio weights and Ei are 
the entropies of the sub-portfolios. 

Portfolio Diversification Index This measure 
tries to assess how many independent bets 
there are using the eigenvalues of the covariance 

2 Here in the sense of equal likelihood. It might indeed be 
impossible to define randomness consistently. 



matrix of the returns of individual assets mak- 
ing up a combined portfolio (which is usually 
estimated from historic data). It is given by 

M 

2^feA fe -l, (4) 

k=l 

where are the ordered and normalized covari- 
ance eigenvalues. In its original form it does not 
deal with the actual weights assigned to the as- 
sets. However, they can be incorporated by con- 
sidering weighted returns, i.e., using the covari- 
ance matrix of the fractional contributions of the 
assets to the total portfolio performance. 

An important shortcoming of this measure is its 
sole reliance on the covariance matrix, which is 
the second moment of a multi-variate distribu- 
tion. Only for the special case of (multi-variate) 
normality does this take all distributional in- 
formation into account (besides the expected 
value) . 

3 Interpretations and Notation 

We follow the Bayesian interpretation of probabil- 
ity [I] as a measure of degree of believe, which is 
always dependent on subjective background informa- 
tion. The probability of proposition A to be true, 
given that B is true is denoted by P{A\B). A and B 
can be composed of several propositions. / denotes 
the available (subjective) background information. 

Similarly, probability distributions are denoted by 
p{A x \B), where A x is a proposition involving a con- 
tinuous variable x. 

Here our primary interest is in probability distri- 
butions for the prices of financial assets. Such dis- 
tributions can be based on subjective beliefs, or they 
can be market-implied distributions, which can be 
extracted from traded derivative prices [31 [3] . 

4 A Measure based on Entropy 

Information entropy is known as a measure of disper- 
sion in a distribution. In contrast to the variance, it 
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measures the expected information gained per out- 
comeU We want to argue here that the information 
entropy of the probability distribution of the final 
value of the portfolio is a natural measure of diversi- 
fication. Let us define such a diversification measure 
D(A\I) as 

/OO 
p(A a \I)\og(p(A a \I)) da, (5) 
-OO 

which is just the negative of the information entropy. 
Here p{A a \I) is the probability distribution for the fi- 
nal value a of portfolio A. Note that the unit depends 
on the base of the logarithm used — for the natu- 
ral logarithm the diversification will be measured in 
"nats", which differ by a factor l/log(2) from "bits". 

As the probability distributions are subjective, i.e., 
dependent on some subjective background informa- 
tion /, so will be the diversification measure. This 
means that our judgment of diversification might 
change due to new information available. 

One pleasing property of this measure is that it 
does not make any distributional assumptions, but 
can be applied to any type of distribution. Obviously, 
it is in itself a challange to find a suitable probability 
distribution expressing information and beliefs about 
the portfolio returns. 

One consequence of the above is that the mea- 
sure is not just a function of a finite number of mo- 
ments/cumulants, like the variance. Instead D(A\I) 
depends on all moments and remains defined even if 
moments become undefined (like for power-law distri- 
butions). For example, for the Cauchy distribution 

p(A x \I) = — -^—2 (6) 

all moments are undefined, but our diversification 
measure remains well-defined and takes the value 

D(A|J) = -log(47T7). (7) 

3 For a discrete probability distribution the information en- 
tropy is invariant under the exchange of any two probability 
values, i.e., it does not depend on the outcome variable, just 
on the probabilities of the possible outcomes. 



For the simple case of normally distributed final 
portfolio values with variance a 2 one finds 

D(A\I) = -i logger 2 ), (8) 

where A stands for the set of propositions A a that 
the final portfolio value will be a. 

The more peaked the probability distribution, the 
larger the value for the diversification measure. In 
the extreme limiting case of a delta function the mea- 
sure approaches infinity. Hence one could argue that 
cash, which generally has the least uncertainty about 
its future value in currency terms, has the highest 
diversification of all investable assets. 

5 Motivation 

Let us consider the case of two independent assets 
A and B, whose future value is normally distributed 
with equal variance and expected value. Our diversi- 
fication measure for each asset takes the value 

D(A\I) = D{B\I) = -i log^vrecr 2 ). (9) 

Let us now form a combined portfolio C by investing 
the fraction wa and wb of the total portfolio value 
in asset A and B, respectively. The final value of 
portfolio C is then given by 

c = WAa + WBb. (10) 

As A and B are normal, C is normal with variance 

(w 2 A + w%)a 2 (11) 

and the diversification measure for C takes the form 

D(C\I) = D(A\I) - log (Jwl + wl) . (12) 

Hence diversification depends logarithmically on the 
distance to the origin in weight space. This is illus- 
trated in Figure [TJ 

This is a sensible behaviour for a diversification 
measure: for equal variances it is maximized for equal 
weights (in which case it increases by half a bit) and 
the increase due to diversification is independent of 



3 




Figure 1: Diversification benefit for two uncorrelated 
assets of equal variance. The change in diversification 
depends on the distance to the origin in the weight 
space. Each circle corresponds to a fixed change in 
the diversification measure (in bits) and the portfolio 
weights determine its position along the straight line. 
The inset shows the change in diversification measure 
as a function of wx ■ 



asset variances. Furthermore, it is additive - if the 
total diversification is k bits, then we can think of it 
as being the equally weighted mixture of 



2(fc-i) 



(13) 



independent assets (of equal variancence) each with 
diversification j < k. 

This result can easily be generalized to the case 
of differing variance a\ and a 2 B for A and B, respec- 
tively. Using Oq = w\a\ + Wg<jg one finds the result 



D{G\I) 



D(A\I) + D(B\I) 



log 



Here the first term is the (equally weighted) average 



diversification and the second term is the diversifica- 
tion benefit dependent on the weights and variances. 

6 Transformation Properties 

Let the value c of portfolio C be a strong monotonic 
increasing function of the value a of portfolio A, the 
"underlying" . The portfolio value probability distri- 
butions are related by 



y'(.c,„,in=^.i„|/i(^ 1 



(15) 



Substituting this into the definition for the diversifi- 
cation measure one finds 



D(C\I) = D(A\I) 
where 



dap(A a \I)log(A), (16) 



da 



(17) 



is the "future delta" (the delta at the time for which 
the diversification measure is evaluated). 
One special case of this is constant gearing 

c = Act, (18) 

where A > and the diversification benefit is 



D(C\I) = D(A\I)- log(X). 



(19) 



This illustrates that gearing (A > 0), which steepens 
the dependence of the final portfolio value on the final 
asset value, decreases diversification. 

The results above can be generallized to strongly 
monotonic decreasing relationships. If there is no 
strong monotonic relationship then the domain can 
be broken into distinct segments, which are either 
strong monotonic or constant. 

7 Examples 

For the examples below we assume that asset returns 
are log-normal with constant (log-return) variance a 2 
and expected growth factor equal to the risk free rate 
growth e rt . Hence the Black-Scholes formulas can be 
used for the option pricing. 
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7.1 Long dated put 

We consider here the diversification over a one year 
horizon. We augment the asset with a bought put 
option with expiry in two years and strike price E. 
Let r = lyr be the remaining life time of the option 
at the time for which we consider the diversification 
measure. The change of the combined portfolio value 
with the underlying asset is then the sum of future 
"deltas" 

A = e- DT [N(di) — 1] + 1, (20) 
where D is the (continuous) dividend yield and 

^ = M*/E) + {r-D + a >/2)T 

Let us assume here that there is no dividend yield on 
the asset, i.e., D = 0. The diversification benefit is 
then 

/oo 
da p{A a \I) log {Nid^a))) (22) 
-oo 

We evaluate this for different strike prices E. Figure [5] 
shows the resulting diversification benefits for differ- 
ent standard deviations (for the annual log returns). 

For a given fixed variance, if the strike price of the 
put option is low, then the final asset value distri- 
bution is hardly affected and the diversification ef- 
fect diminishes. As the strike price increases more 
"negative" outcomes (asset value below strike price) 
are eliminated as the put-payoff compensates for the 
loss. This is equivalent to a narrowing of the proba- 
bility distribution and an increase of the diversifica- 
tion measure. We note that the diversification ben- 
efit gained with increasing strike price is far more 
substantial for less volatile assets. 

Note that the option considered here still has one 
year left to expiry at the time for which we consider 
the diversification measure. The above still holds, 
with the trend becoming more extreme as there is 
less time to expiry remaining. 

7.2 Put-Spread 

The put spread, being the combination of a bought 
upper put and a sold lower put, is now considered. 




Figure 2: Diversification benefit (considered for one 
year) of a long put position with a two year term. 
The increase in the diversification measure is shown 
for different strikes and volatilities. Here we assumed 
a risk-free rate and expected value of the log-normal 
distribution of 10% per annum. 

The totla portfolio future delta is now 

A = e- DT [N(dx (x,E u )) - N{di{x,Bh))] + 1, (23) 

where E u is the strike of the upper put and E\ is the 
strike of the lower put. 

Figure [3] shows the resulting diversification benefits 
for the 10% put spread as a function of the (upper) 
strike price given different standard deviations (for 
the annual log returns). The shape of the curve re- 
sults from the competition between the increase in 
the diversification benefit from increasing the strike 
price of the upper put as illustrated in Figure [5J and 
the loss of diversification from the lower sold put. 
Note though that these effects are not additive as 
follows from (p~6j) and the non-linearity of the loga- 
rithm. 
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Figure 3: Diversification benefit (considered for one 
year) of a 10% put-spread position with a two year 
term. The increase in the diversification measure 
is shown for different strikes of the upper put and 
volatilities. Here we assumed a risk-free rate and ex- 
pected value of the log-normal distribution of 10% 
per annum. 

7.3 Collar 

The collar is a popular strategy because the downside 
protection is (partially or fully) financed by selling 
upside participation (in form of a sold call). As the 
collar is the combination of a bought put with a sold 
call we have (including the underlying) 

A = e- DT [N(d 1 {x,E p ))-l}~N(d 1 (x,E c )) + l, (24) 

where E p is the strike of the put, and E c the strike 
of the call. 

Figure |4] shows the net diversification benefit 
achieved from a collar in terms of the strike of the 
upper sold call option. The strike of the bought put 
was fixed at 100% and curves for different volatilities 
(of the annualized log-return distribution) are shown. 

As the call strike price approaches the put strike 
price at 100% all uncertainty is removed (it becomes 



Figure 4: Diversification benefit (considered for one 
year) of a collar position with a two year term. The 
increase in the diversification measure is shown for 
different strikes and volatilities. Here we assumed a 
risk-free rate and expected value of the log-normal 
distribution of 10% per annum. 

a synthetic sold future paired with the underlying 
asset) and the diversification benefit increases to in- 
finity. The amount of diversification is notably higher 
in the case of more volatile assets. 

8 Conclusion 

In this paper we advocate that a measure of portfolio 
diversification should be a functional of the (subjec- 
tive) probability distribution for the final portfolio 
value. In this way it automatically incorporates as- 
set exposures and asset characteristics. 

We propose the negative information entropy of 
the probability distribution of the final portfolio value 
as a suitable diversification measure, which takes all 
moments into account. 

This measure can be applied to all types of dis- 
tributions and does not require that asset returns or 
weights are defined. This means that this measure 
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can deal with holdings which can have positive and 
negative values, like derivative structures, in a port- 
folio. 

For the simplistic case of independent, normally 
distributed asset returns, we have demonstrated that 
the negative information entropy is a logarithmic 
function of the portfolio variance. Combining Gaus- 
sian assets of equal variance adds an amount to the 
diversification measure which only depends on the 
asset weights. 

For derivative overlays our result (fT6"|) links the fu- 
ture delta to the diversification benefit. 

For the more realistic example of log-normal final 
asset value probability distributions we have given 
some simple examples illustrating how derivative- 
overlays lead to a diversification benefit, as measured 
by our index. 

As protective derivative overlays narrow the prob- 
ability distribution of the final portfolio value, they 
increase the diversification as measured by our mea- 
sure. 

We find that for a long-dated put option, the di- 
versification rises rapidly with increasing strike price 
for less volatile underlying stocks. For the case of a 
put spread, this index can be used to select the strike 
prices that maximizes the net diversification benefit 
for a given level of volatility. For the case of a col- 
lar, the portfolio diversification benefit achieved in- 
creases dramatically as the sold call strike approaches 
the bought put strike. 



A Understanding Information 
Entropy 

Let us assume that (discrete) events i occur with like- 
lihood pi in a repeated experiment. On observing the 
outcome i the information gained in bits is — log 2 (pi). 

One way to look at this is as follows: if our state 
of knowledge is that we judge an object to be equally 
likely in one of 2" positions then we gain n bits when 
we find out where it actually is. 

As events occur with probability pi the average in- 



formation gain (in bits) is 



(25) 



which is called the information entropy. 

The information entropy has some interesting 
properties: 

• E — if pj — 1 for one j . 

• E takes a maximum value of \og 2 (N) if the N 
possible outcomes are equally likely. 

• if outcome j is really a combination of different 
outcomes j, k then 



E — E' + pjE" , 



(26) 



where E' is the entropy based on Pj (the proba- 
bility that one of the k events occures) and E" 
is the entropy for outcomes k, given that j (one 
of them) occured. 

Maximum entropy methods generally maximize 
the information entropy subject to constraints on the 
probabilities, such as specified moments. 



B Log-normal assets 

The log-normal distribution with expected value e v 
is given by the probability distribution 



P(A X \I) = 



1 



x\/2ira 2 



(27) 



where a 1 is the variance of the distribution of log(ir). 
This yields for the diversification measure 



D(A\I) 



1 1 



log(27Rj 2 ) - u + (T 2 /2. (28) 



It is interesting to note that in contrast to the gaus- 
sian case (which depends solely on the variance) , the 
measure here also depends on the expected value. 
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C Effects of Skewness and Kur- 
tosis 

As the normal distribution is a maximum entropy dis- 
tribution, any other distribution with equal variance 
must have more diversification according to our mea- 
sure. Figure [5] shows the increase in diversification 
for maximum entropy distributions of given skewness 
and kurtosis for a fixed variance of a 1 = 0.25 2 . 




Figure 5: Plot showing the diversification index de- 
crease (from the corresponding value of the nor- 
mal distribution) for maximum entropy distributions 
of varying skewness and kurtosis for fixed variance 
(25%) (the undefined values have been set to zero.) 

D Comments on Weight En- 
tropy and Herfindahl Index 

The Weight Entropy and Herfindahl index are both 
diversification measures solely based on portfolio 
weights. We want to investigate the difference be- 
tween these measures for the two and three asset case. 

For a portfolio with two assets, the Weight Entropy 
(entropy) and Herfindahl Index (quadratic) measures 



are shown below in figure [51 Both are maximal for 
Wyn = 0.5. If the contribution to the overall measure 
from one asset only is considered, we see (in the lower 
panel) that large weights effectively give a negative 
contribution for the quadratic measure, while there 
are no negative contributions to the entropy measure. 




0.2 0.4 0.6 0.8 1 

wl 



Figure 6: Comparison of entropy and Herfindahl In- 
dex Q^ =1 (l/iV — w i) f° r two weights wi and W2 
(wi + u>2 = 1). The upper graph shows the overall 
measure and the lower graph shows the contribution 
of one weight. The measures have been re-scaled to 
cover the same range. 

Figure [7] shows the differences for the three asset 
case. There are large areas where the quadratic mea- 
sure (Herfindahl Index) understates diversification by 
up to 10% of the maximum value compared to the en- 
tropy measure. On the other hand, around w\ = 0.5 
and W2 = h(l — wi) = (and equivalently for wi = 
and W2 — 0.5) the quadratic measure overstates di- 
versification relative to the entropy measure. 

Both measures can be extended to the correlated 
gaussian case by considering the exposure to the 
"eigen- vector portfolios" of the correlation matrix in- 
stead of the actual asset weights. This, however, still 
does not account for varying risk profiles of the assets 
(or eigen- vector portfolios). 
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Figure 7: Comparison of entropy 

— X^i=i w i ]°s( w i)/ l°g(3) and re-scaled Herfind- 
ahl Index (§2i=i(V^ — f° r three weights. 

Here = Ml — wi), which defines /i in terms 
of the weights. Both measures coincide with their 
maximum at w\ = 1/3 and h = 0.5. Shown is 
the difference of both moeasures — red indicates a 
higher entropy measure of diversification relative to 
the quadratic measure. Values for h' > 0.5 are equal 
to values for h = 1 — h'. 
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